Comparative Performance Analysis of Feature(S)-Classifier Combination for Devanagari Optical Character Recognition System
نویسندگان
چکیده
this paper presents a comparative performance analysis of feature(s)-classifier combination for Devanagari optical character recognition system. For performance evaluation, three classifiers namely support vector machines, artificial neural networks and k-nearest neighbors, and seven feature extraction approaches viz. profile direction codes, transition, zoning, directional distance distribution, Gabor filter, discrete cosine transform and gradient features have been used. The first four features have been used jointly as statistical features. The performance has also been evaluated by using the combination of these feature extraction approaches. In addition, performance evaluation has also been done by varying the feature vector length of Gabor and DCT features. For training the classifiers, 7000 samples of first 70 classes (out of 942 classes), recognized in the earlier work have been used. Such a large number of classes are due to the horizontal and vertical fusion/overlapping characters. We have chosen first 70 classes as their percentage contribution out of 942 classes has found to be 96.69%. For testing, 1400 samples have been collected separately. A corpus of 25 books has been used for sample collection. Classifiers trained on different features, have been compared for performance evaluation. It has been found that support vector machines trained with Gradient features provide the classification correctness of 99.429%, and there is no significant increase in the performance with the increase in the feature vector length. Keywords—Artificial Neural Network; DCT; Directional Distance Distribution; Feature extraction, Gabor; k-Nearest Neighbour; Profile direction codes; Support Vector Machines; Transition; Zoning
منابع مشابه
Zernike Moment Feature Extraction for Handwritten Devanagari (Marathi) Compound Character Recognition
Compound character recognition of Devanagari script is one of the challenging tasks since the characters are complex in structure and can be modified by writing combination of two or more characters. These compound characters occurs 12 to 15% in the Devanagari Script. The moment based techniques are being successfully applied to several image processing problems and represents a fundamental too...
متن کاملHandwritten Devanagari Character Recognition Using Gradient Features
We describe novel methods of feature extraction for recognition of single isolated Devanagari character images. Our approach is flexible in that the same algorithms can be used, without modification, for feature extraction in a variety of OCR problems. These include handwritten, machine-print, grayscale, and binary and low-resolution character recognition. We use the gradient representation as ...
متن کاملHandwritten and Printed Devanagari Compound using Multiclass SVM Classifier with Orthogonal moment Feature
Handwritten Devanagari character plays a vital role in the research area. Number of technique has been adopted in previous decades and still some new are arising to get good results from recognition system. In Devanagari, Compound character are complex in structure, they are written by combination two or more character. Due to complex structure in it, it gives a challenging task to the research...
متن کاملHandwritten Devanagari (Marathi) Compound Character Recognition using Seventh Central Moment
Compound character recognition of Devanagari Script (Marathi language) is one of the challenging tasks since the compound character is combination of one or more characters. These characters can be treated as fusion of two or more characters and hence these are complex in structure. Marathi, Hindi, Sanskrit and Nepali are written with Devanagari script. All these languages have compound charact...
متن کاملOptical Character Recognition for Isolated Offline Handwritten Devanagari Numerals Using Wavelets
This paper presents a method of recognition of isolated offline handwritten Devanagari numerals using wavelets and neural network classifier. This method of optical character recognition takes the handwritten numeral image as input. After pre-processing, it is subjected to single level wavelet decomposition using Daubechies-4 wavelet filter. This wavelet decomposition allows viewing the input n...
متن کامل